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Abstract 

Minimizing the relative inertia of a statistical group with respect 
to the inertia of the overall sample defines an unique point, the in- 
focus, which constitutes a context-dependent measure of typical group 
tendency, biased in comparison to the group centroid. Maximizing the 
relative inertia yields an unique out- focal point, polarized in the reverse 
direction. This mechanism evokes the relative variability reduction 
of the outgroup reported in Social Psychology, and the stereotypic- 
like behavior of the in-focus, whose bias vanishes if the outgroup is 
constituted of a single individual. In this picture, the out-focus plays 
the role of an anti-stereotypical position, identical to the in-focus of 
the complementary group. 

Keywords: anti-stereotype, central tendency, context-dependent po- 
larization, Huygens principles, metacontrast ratio, outgroup homogeneity, 
relative dispersion, stereotype bias, variability reduction 

1 Introduction 

The expression "typical features of a group" is ambiguous: it might either 
refer to a multivariate indicator of group central tendency, that is to an un- 
biased group centroid, or to the distinctive, unique characteristic tendencies 
of the group, contrasting the features observed in the other groups or in the 
complete sample, in which case it constitutes a caricatural stereotype. 

A stereotype summarizes the features of a whole group into a single 
profile (variability reduction) . Its profile is generally distinct from the group 
centroid, pushed away from the overall centroid of the whole population or 
context under consideration (bias or polarization). 
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These two characteristics of stereotypy have been largely reported in 
Social Psychology, in particular with reference to the outgroup: 

1) people tend to minimize the differences between outgroup members, 
while being inclined to perceive their own group as made of an het- 
erogeneous set of unique individuals (outgroup relative homogeneity): 
"They all look alike but we don't". See e.g. Quattrone and Jones 
(1980), Taylor et al. (1978), Park and Hastie (1987), Mullen and Hu 
(1989), and references therein. 

2) people tend to exaggerate the typical traits of the outgoup, and to 
enhance the contrasts between members of different groups (stereotype 
polarization or bias effect). See e.g. Turner (1975), Hopkins and Cable 
(2001), Hogg et al. (2004), Realo et al. (2009), and references therein. 

This paper presents a simple, formal, principled mechanism linking the 
two aforementioned aspects of stereotypy. Specifically, we show that mini- 
mizing the relative group dispersion defines an unique point in the feature 
space, called the in- focus (Theorem [I]) , manifesting the exaggeration or po- 
larization effect expected from a stereotype (Theorem [2]) . The polarization 
can be qualified as fair, in the sense it vanishes for a group formed of a 
single individual, which thus coincides with its own stereotype. Increasing 
the group dispersion increases the polarization effect, as shown by (jl]) and 
©• 

Furthermore, maximizing the relative group dispersion yields another 
unique point conjugate to the in-focus, the out-focus, bearing the character- 
istics of an anti-stereotype or antitype. The group out-focus coincides with 
the in-focus of the complementary group (Theorem [3]) , as illustrated on the 
U.S. Congressmen data (Section 12. 41) . 

Section [3] attempts to justify the position of the in-focus in a decision- 
theoretical setup, and underlines the connections with the meta- contrast 
model and simulations of Salzarulo (2006), whose work initially triggered 
the present research. 

2 The formal model 

2.1 Definitions and notations: Huygens principle 

Consider a totality n individuals, denoted i = 1, ... ,n, characterized by a 
multivariate profile of p features with k = 1, . . . ,p. These features define 
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a squared Euclidean distance between individuals 



Dij — ^ ~] ( x ik %jk) *> 3 



=1 



For generality sake, we assume that individuals possess weights fa > 0, with 
Sr=i /» = 1- The un if° rm weighting obtains as fa = 1/n. Also, consider a 
profile a 6 M p , which might or might not correspond to the features of an 
existing individual. Huygens principle consist of the identities 
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A / = E fi D * = A f + ^ A / = 2 ^ Aj (2) 

i=l ij 

where A^ is the inertia relative to the reference point a, Df a is the squared 

distance between the point a and the centroid Xf = ^ faxi, and Aj = A^ 
is the inertia relative to the centroid. In particular, ([2]) shows that A^ 
attains its minimum A t for a = x t . 

Now consider a group g of individuals. A group is specified by the 
individuals it contains, that is, in full generality, by a distribution > 
with YH=i 9i = !• I n m ost situations, the support of g (that is the set of 
individuals for which g% > 0) is a strict subset of the complete set of the n 
individuals, but this restriction is not necessary. We however assume that 
the group centroid x g = ^^giXi differs from the overall centroid Xf, that is 
Df g > 0. As before, A^ takes on its minimum value A g for a = x g . 

2.2 The relative dispersion 

Definition 1 (Relative dispersion) The relative dispersion of group g in 
context f, relatively to the reference point a is 

A a A + D 
5(a) = 5(a\g, = == 9 " a . (3) 

v ; A^ Af + Df a w 

The relative dispersion measures the disparity or heterogeneity in group 
g, in units determined by the overall heterogeneity, as assessed from some 
reference point of view a. Varying the point of view enables to tune, within 
some limits, the apparent, perceived relative heterogeneity of the group g. 
Remarkably enough, the relative dispersion 5(a) is finite everywhere, and 
possesses an unique minimum a_ as well as an unique maximum a + : 
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relative dispersion 8(a) 
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Figure 1: Abscissa: absolute coordinates a and relative coordinates e, along 
the line (jlj) passing through the group centroid x 5 (associated to the black 
objects) and the overall centroid Xf (associated to the black and white ob- 
jects). Ordinate: the relative dispersion 5(a) is minimum for the in- focus 
points a_, maximum for the out-focus points a+, and tends to unity as 
e — > ±00. 



Theorem 1 (In- and out-focus points) Both the in-focus point a min- 
imizing 5(a), and the out-focus point a + maximizing 5(a) are unique, and 
given by a± := a(e±), with 

a(e):=x f + e(x f -x g ) e± = ± y/% + ^f D fg) ( 4 ) 

where bf g := Af — A g — Df g . 

By theorem [H the in- and out-focus points lie on the line joining centroids 
Xf and x g . In-focus polarization occurs if a_ lies "on x g side", that is if 
e_ < —1, as in Figure HJ Similarly, out-focus polarization occurs if a+ lies 
"on Xf side", that is if e + > 0. Theorem [2] insures this to be always the 
case. 

Theorem 2 (Polarization) In- and out-focus points a± fall outside the 
interval [x g ,xj], as in figure^ Specifically, 

e < -1 < e + < (5) 

where both inequalities are attained iff A g = 0, as in the case of a singleton, 
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dispersion-free group. For small A g , 

z- = ~1-[ A l n ]A 3 + 0(A^) (6) 
A .f + D fg 

Proofs: let e := — a|| be fixed, and consider the angle a between x g 
and a as measured from Xf. By the cosine theorem, 



A g + D ga _ A g + D f g + e 2 -2 v mJgecosa 
A f + D fa ~ Af + e 2 

which is maximum for a = 180° and minimum for a = 0°. In both cases, 
the extremum a lies on the line passing through x g and xj, i.e. is of the 
form a(e) in flU, with relative dispersion (A 9 + Df g (e + 1) 2 )/(A/ + Df g e 2 ). 
Setting to zero its derivative in e yields Df g e 2 — bf g e — Aj = 0, with solutions 
e± (with the correct sign) given by (jU). Furthermore, it is easy to show that, 
for Df g and Af fixed, both expressions e_ and e+ are decreasing in A g , and 
take on their maximum value §5§ for A g = 0. □ 

2.3 Other expressions 

The following features-based expression may be computationally useful: 

a± = s ^a i (e±) Xi a^e) := (1 + e)/j - egi . (8) 

i 

a(e) is a signed distribution, that is normalized to unity but not necessarily 
non- negative. 

Also, twice application of Huygens decomposition (or direct manipula- 
tion of the features) demonstrates the distance-based identities 

D fg = ~2 XV' ~ !l ' )[ - f > ~ 9o) D H 1 'In = M ~ 9j) D iJ ■ 

ij ij 

Finally, define the squared polarization ratio as 

^ = (e+ _^ = (1 + ^±^ f _4^>l 
D fg D fg D f g 

which shows the polarization to increase with each of the inertias A^ and 
A g . In particular, the polarization ratio takes on its minimum value unity 
iff Ag = Af = 0, and a_ = x g iff Ag = 0, as shown by (0). 
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2.4 Illustration: U.S. Congressmen 



In the legislature 1984, a number of p = 16 "key votes" from n = 435 US 
Congressmen, comprising ur = 168 Republicans and no = 267 Democrats, 
have been coded as 1 ("yea") or ("nay")0. Missing values have been 
replaced by the average value inside the affiliated political group. 

The overall, Republican and Democrat distributions read respectively as 

1 I(i € R) _ I(i € D) 

fi = - g% = 9i = • (9) 

n ur nr> 

where 1(A) denotes the characteristic function of event A. After computa- 
tion of the squared Euclidean distances (pQ) from the Congressmen votes, a 
classical multidimensional scaling (MDS) has been performed with uniform 
weighting of the individuals (see e.g. Mardia et al. 1979) to obtain the fac- 
torial coordinates expressing a maximum amount of the overall dispersion 
A/ (Figure^]). With D fg = 1.98, A f = 3.67 and A g = 1.89, the polarization 
ratio is |e+ — e_| = 2.72. 



2.5 Complementary group 

By construction (Section 12. 2h . the in- focus a_ is the point of view under 
which group g appears, relatively to the ground or context formed by the 
complete set of individuals /, as homogeneous as possible, and turns out to 
constitute a credible candidate for representing a stereotypical value. By 
contrast, the out-focus a+ is the point of view maximally respectful of the 
features diversity in g, and behaves as an antitype - in the sense of "anti- 
stereotypical" . 

In the example of Figure the out-focus a + of the Republicans seems to 
be equally qualified to represent the in-focus of the Democrats (not drawn 
on the Figure). As a matter of fact, the two points coincide, as justified by 
Definition [2] and Theorem [3] below. 

Definition 2 The group complementary to group g in context f is defined 
by a normed distribution g, which, mixed with g, reproduces f, in the sense 
pg + (1 — p)g = f , or equivalently 

ffi = -4 for some p <E (0, /9 max ] with p max := mm(fi/gi) . 

1 — p i 

(10) 

1 http:/ /archive. ics.uci.edu/ml/machine-learning-databases/voting-records/ 
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Figure 2: First MDS coordinates obtained from the squared distances be- 
tween U.S. Congressmen (legislature 1984), expressing 58% of the overall 
dispersion Af. Red circles depict the positions of the Republicans, and blue 
circles those of the Democrats, xt is the overall centroid, x g is the Repub- 
licans centroid, Xg is the Democrats centroid. a_ is the in-focus relative to 
the Republicans, and a + the corresponding out-focus; see also Section 12.51 



The definition of /9 max insures the non- negativity of g. For instance, the 
Democrats group g in ([9]) is complementary to the Republicans group g 
since pg + (1 — p)g = f with p = nn/n = .39, which turns out to be equal 
to its maximum value p max . 

Theorem 3 The out-focus point a 9 + for group g is the in-focus point a g _ of 
any group g complementary to g. 

Proof: substituting / = pg + (1 — p)g in ([2]) and developing the first 
identity demonstrates = p(A g + D ga ) + (1 — p)(Ag + Dg a ), that is 
p S(a\g, f) + (1 — p) S(a\g,f) = 1. Hence, p, f and g being fixed, maxi- 
mizing 5(a\g, f) amounts in minimizing 5(a\g,f). □ 
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Note the identity 



5'(a) = 5{a\ 9 ,g) = % = ^ f ^- (11) 

g 

which demonstrates that extremalizing the relative dispersion 5(a) = 5(a\g, f) 
or its variant 5' (a) yields the same solutions. 



3 Further connections 



3.1 Decision theory 

The following argument constitutes a first attempt towards a derivation 
of the in-focus in a decision-theoretical framework. Consider the deci- 
sion rule "attribute individual i either to group g with probability P(g\i) = 
exp(— fiDia) or to the overall set f with probability P(f\i) = 1 — P(g\i)", 
where /3 > is a parameter controlling the decay of the exponential and a a 
point to be chosen wisely. The probability of misidentifying an individual of 
g (miss) is P(f\g) = J2i 9iP(f\i)j aricl the probability of correctly identifying 
an individual of / as such (correct rejection) is P(f\f) = fiP(f\i)- In 
the limit of large spread, the ratio of these quantities becomes 

S^oP(co.re.) 9-*>P(/|/) S-.0 ff^i /iA» + 0(/3 2 ) AJ V ' 

In this context, the probability of miss is minimized by the group centroid x g , 
while the ratio of the probabilities "miss over correct rejection" is minimized 
by the in-focus a_ - a somewhat intriguing result to be further investigated. 



3.2 Subtractive combinations 

Instead of studying the relative dispersion ratio ([3]), one can consider the 
subtractive combination of the form 

7(a) = A A a g - B A a f A,B €R . (12) 

If A + B 7^ 0, the two parameters can be normalized as A = 1 — A and 
B = X. For A ^ 0.5, 7(a) possesses a unique bounded extremum at e = 
(A — 1)/(1 — 2A) (following parameterization ([%])), which turns out to be a 
minimum for A < 0.5 and a maximum for A > 0.5; no bounded extremum 
exists for A = 0.5. If A + B = 0, 7(a) possesses a unique bounded extremum 
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at the mid-point e = —0.5, which constitutes a minimum for A > and a 
maximum for A < 0. In any case, the position of the extremum does not 
depend upon the dispersions A/ and A g . Similar results are obtained when 
replacing / by g in (fl~2jh 

3.3 Meta-contrast ratio and prototypicality function 

More interesting, and considerably more involved is the study of the follow- 
ing function, appearing in the framework of the self- categorization theory 
(where groups are not given a priori), proposed by Salzarulo (2006), and 
referred to him (up to a sign) as the prototypicality function: 

r(a) = (l-A)A« (a) -AA| (a) A €[0,1] (13) 

Here gi(a) = exp(— /3Dj a )/Z(a), where Z{a) is the normalization constant: 
in this approach, the very composition of group g depends on the distance of 
its constituents to a. Also, g(a) is of the form (fTUj) with p(a) = Z(a)/n < 1. 

The function f 1 1 3 [) is primarily meant as an improved variant of the meta- 
contrast ratio (Haslam and Turner 1995; Turner et al. 1987; Oakes et al. 
1994), measuring the relative differences between individuals, and aimed at 
predicting to which extent a given individual will be perceived as belonging 
to the subject group. 

The highly non-linear properties of of T(a), whose minima are interpreted 
as prototypical positions, can be built on to run dynamical numerical simula- 
tions exhibiting groups formation and destruction, in the context of opinion 
formation, for various values of A and (3. In particular, two agents initially 
categorizing themselves as different can perceive themselves as belonging to 
the same group in presence of a third agent distant from them; also, fit- 
ting experimental data is possible, as those of Haslam and Turner (1995), 
satisfactorily reproduced with A = .08 and (3 = 7.7, on a one-dimensional 
opinion space x 6 [0, 1]. See Salzarulo (2006) for more details. 

4 Discussion and conclusion 

The mechanism relating the minimization of the relative dispersion to the 
polarization of the in-focus is entirely mathematical, and relevant to the con- 
struction of statistically biased, context-dependent measures of central (or 
"typical") tendency. However, the parallel with a few predominant themes 
of Social Psychology seems striking, and we did not resist the temptation to 
interpret the in-focus as a stereotype, and the statistical group g as an out- 
group. The extent to which the metaphor is legitimate is be judged within 
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Social Psychology. Among the points potentially stimulating, let us mention 
the question of the identification of the ingroup, of which both the context 
/ and the complementary g are legitimate candidates - with similar if not 
identical effects, in view of Theorem [3] and (jlip . 

The formalism we have used is both general, that is using weighted 
groups allowing fuzzy memberships, and classical, that is using squared Eu- 
clidean distances as measures of dissimilarities. Euclidean distances permit, 
in contrast to other dissimilarities, to extract the original features through 
MDS (up to a rotation in the features space); they furthermore additively 
decompose accordingly to Huygens principles, the use of which has been 
crucial in the present paper. 

Non-Euclidean dissimilarities, to which alternatives measures of central 
tendency are associated, such as the trimmed mean or the median (e.g. Ham- 
pel et al. 1986), are perfectly legitimate, and possibly better justified form 
robustness considerations. The resuting polarization of the in- focus would 
certainly deserve proper studies, which are however bound to be technically 
more involved. 
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